Scalability of Parallel Sorting on Mesh Multicomputers
نویسندگان
چکیده
This paper presents two new parallel algorithms QSP1 and QSP2 based on sequential quicksort for sorting data on a mesh multicomputer, and analyzes their scalability using the isoeeciency metric. We show that QSP2 matches the lower bound on the isoee-ciency function for mesh multicomputers. The isoef-ciency of QSP1 is also fairly close to optimal. Lang et al. and Schnorr et al. have developed parallel sorting algorithms for the mesh architecture that have either optimal (Schnorr) or close to optimal (Lang) run-time complexity for the one-element-per-processor case. Both QSP1 and QSP2 have worse performance than these algorithms for the one-element-per-processor case. But QSP1 and QSP2 have better scal-ability than the scaled-down variants of these algorithms (for the case in which there are more elements than processors). As a result, our new parallel formulations are better than these scaled-down variants in terms of speedup w.r.t the best sequential algorithms. We also present a diierent variant of Lang's sort which is asymptotically as scalable as QSP2 (for the multiple-element-per-processor case). We brieey discuss another metric called \resource consumption metric". According to this metric, both QSP1 and QSP2 are strictly superior to Lang's sort and its variations .
منابع مشابه
Executing Algorithms with Hypercube Topology on Torus Multicomputers
Many parallel algorithms use hypercubes as the communication topology among their processes. When such algorithms are executed on hypercube multicomputers the communication cost is kept minimum since processes can be allocated to processors in such a way that only communication between neighbor processors is required. However, the scalability of hypercube multicomputers is constrained by the fa...
متن کاملEmbedding Hypercubes onto Rings and Toruses
Many parallel algorithms use hypercubes as the communication topology among processes. When such algorithms are executed on hypercube multicomputers the communication cost is kept minimum since processes can be allocated to processors in such a way that only communication between neighbor processors is required. However, the scalability of hypercube multicomputers is constrained by the fact tha...
متن کاملThe Scalability of FFT on Parallel Computers
In this paper, we present the scalability analysis of parallel Fast Fourier Transform algorithm on mesh and hypercube connected multicomputers using the isoefficiency metric. The isoefficiency function of an algorithm architecture combination is defined as the rate at which the problem size should grow with the number of processors to maintain a fixed efficiency. On the hypercube architecture, ...
متن کاملParallel Triangulation of a Set of Points for Coarse Grained Multicomputers
In this study we address the problem of efficient parallel triangulation methods for a finite set of points in the plane. The main goals of the research were to identify scalable algorithms which achieve a significant speedup over the sequential solutions and to implement and evaluate their performance on a parallel machine. Two approaches for parallel triangulation, one of which relies on glob...
متن کاملEffective Load Balancing on Highly Parallel Multicomputers Based on Superconcentrators
Tree and mesh architectures have been considered as two of the most highly scalable parallel multicomputers due to their scalabilities are superior to that of hypercubes. However, the load balancing on these two multicomputer systems are not so well as we expected. The worst case of tree architecture requires O ( M x p x logp) routing time for redistributing the workload over the system and it ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1991